Certificate of Mailing 
Date of Deposit: December 1. 2003 Label N umber: EV247566448US 

I hereby certify under 37 C.F.R. §1.10 that this correspondence is being deposited with the United States Postal Service as 
"Express Mail Post Office to Addressee" with sufficient postage on the date indicated above and is addressed to Mail Stop Patent 
Application, Commissioner for Patents, P.O. Box 1450, Alexandria, VA 22313-1450. 



Guv Beardslev v 

Printed name of person mailing correspondence Signature of 'person railing corresponded 



f'person rwailing correspondence 



APPLICATION 
FOR 

UNITED STATES LETTERS PATENT 

APPLICANT FAHRI SAATCIOGLU 



TITLE DIFFERENTIALLY EXPRESSED GENES IN 

PROSTATE CANCER 



F:\50218\50218.002004APPLICATION COVER SHEET.DOC 



1 - 



DIFFERENTIALLY EXPRESSED GENES IN PROSTATE CANCER 

This application claims the benefit of U.S. provisional application number 60/135,325 
filed May 20, 1999, and U.S. provisional application number 60/135,333 filed May 20, 1999, 
both of which are incorporated herein by reference in their entirety. 

5 Field of The Invention 

The field of the invention is neoplastic diseases, arid especially detection and therapy of 
prostate cancer. 

Background of The Invention 

Prostate cancer has become the most commonly diagnosed malignancy in males in the 
10 western world, and is the second most common cause of cancer death among men in Europe and 
the United States (Boring, C.C., Squires, T.S., and Tong, T. (1993). Cancer J. Clin. 43, 7-26; 
Carter, H.B., Pianpadosi, S., and Isaacs, J.T. (1990). J. Urol. 143, 742-746). Worse yet, in recent 
years the annual incidence rate of newly diagnosed prostate cancer, as well as the number of 
prostate cancer deaths continuously rose. 

15 Androgens not only play a key role in the development and maintenance of the normal 

prostate, but also in the initiation and progression of prostate cancer (Moore, R.A. (1944). 
Surgery 16, 152-167; Huggins, C, and Johnson, M.A. (1947). J. Am. Med. Assoc. 135, 1 He- 
ll 52). For example, androgens typically induce cell proliferation and inhibit cell death in the 
healthy prostate gland. Withdrawal of androgens stops proliferation of cells and induces apop- 

20 tosis with concomitant involution of the prostate gland. Involution upon androgen withdrawal is 
generally a characteristic of a normal prostate gland as well as of a prostate tumor in the early 
stages of the disease, when the tumor still remains androgen dependent. Consequently, androgen 
withdrawal treatment is commonly used to reverse tumor growth. However, in the case of many 
prostate tumors, the tumor recurs after a few months or years almost invariably in an androgen 

25 insensitive state. At this point, successful therapy of prostate cancer is difficult and prognosis for 
survival is usually relatively low. 

Almost all of the biological effects of androgens are mediated by the androgen receptor, a 
honnone-activated transcription factor. Even though the androgen receptor was cloned over ten 
years ago, the mechanisms by which the androgen receptor regulates gene expression is not well- 
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understood. Furthermore, only a very few of its target genes have been identified, including the 
prostate specific antigen (PSA), the related glandular kallikrein 2 (hKLK2), and the androgen 
receptor itself. Another androgen regulated protein, a secreted serine protease with prostate 
restricted expression, termed "prostase" was recently described by Nelson et al. (Nelson, P.S., et 
5 al. (1999) Proc. Natl Acad. Set 96, 3114-3119). However, the biological functions of the 
proteins coded by the PSA, hKLK2 and prostase genes are poorly understood at present. 

Adding to the difficulties in understanding the role of androgens in prostate cancer is that 
in vivo and in vitro model systems frequently do not closely mimic the human disease. 
Furthermore, close homologues of the PSA gene, the only marker for prostate cancer in human, 

10 are not known in other animal species. Moreover, in vitro studies are hampered due to the 

limited number of cell lines that have been derived from human prostate. For example, the only 
androgen sensitive and androgen responsive cell line that is widely used is LNCaP, characterized 
by cells originally derived from a lymph node metastasis of a human prostate carcinoma 
(Horoszewicz, J.S., Leong, S.S., Kawinski, E., Karr, J.P., Rosenthal, H., Chu, T.M., Mirand, 

15 E.A., and Murphy, G.P. (1983). Cancer Res. 43, 1809-1818). 

In spite of numerous studies on the effects of androgens in the role of prostate cancer, 
relatively little is known about the molecular genetic effects of androgens in prostate cells. More 
detailed knowledge about androgen responsive genes and their role in signal transduction, as 
well as gross morphological and physiological transformations will potentially result in better 
20 diagnostic tools, and provide possible new targets for a drug-based therapy of prostate cancer. 
Therefore, there is still a need to provide improved methods to identify androgen responsive, 
differentially expressed genes in prostate cancer. 

Summary of the Invention 

The present invention is directed to differentially expressed genes in neoplastic cells, and 
25 particularly relates to hormone depehdent genes in prostate cancer. The polynucleotides with the 
SEQ ID NO:l - SEQ ID NO:7 encode an intracellular protein, and while the corresponding poly- 
peptides SEQ ED NO:l 1 - SEQ ID NO: 14 have an intracellular location, the corresponding ex- 
pression products SEQ ID NO:8 - SEQ ID NO: 10 have predominantly perinuclear, nuclear and 
predominantly nuclear localization within a cell, respectively. 



In one aspect of the inventive subject matter, SEQ ID NO:l - SEQ ID NO:7 are expres- 
sed in prostate cancer cells in a hormone dependent manner. 

In another aspect of the inventive subject matter, a method of detecting a neoplastic cell 
in a system includes a step in which a predetermined amount of an RNA comprising at least one 
of SEQ ID NO: 15 - SEQ ID NO:21 is correlated with the presence of a neoplastic cell, and the 
predetermined amount, or more, is subsequently detected in the system. Contemplated detection 
methods preferably employ a labeled probe that is detectable via fluorescence detection, 
luminescence detection, scintigraphy, autoradiography, or formation of a dye. Alternative 
preferred detection methods include addition of at least one nucleotide to the probe (e.g., PCR, 
LCR). 

In a further aspect of the inventive subject matter, a method of detecting a neoplastic cell 
in a system includes a step in which a predetermined amount of a polypeptide comprising at least 
one of SEQ ID NO:8 - SEQ ID NO:14 is correlated with the presence of a neoplastic cell, and 
the predetermined amount, or more, is subsequently detected in the system. Contemplated detec- 
tion methods preferably employ a labeled probe that is detectable via fluorescence detection, 
luminescence detection, scintigraphy, autoradiography, or formation of a dye, and preferred 
probes include antibodies, antibody fragments, and natural and synthetic ligands of the polypep- 
tide. 

In a still further aspect of the inventive subject matter, a method of identifying differenti- 
ally expressed genes in a target tissue has one step in which a target tissue-specific cDNA library 
is prepared by suppression subtractive hybridization, and a plurality of genes from the library is 
immobilized on a solid phase. Nucleic acid preparations from treated and untreated target tissue 
are individually hybridized with array, respectively, and hybridization patterns are compared to 
identify differentially expressed genes. 

Various objects, features, aspects and advantages of the present invention will become 
more apparent from the following detailed description of preferred embodiments of the inven- 
tion, along with the accompanying drawing. 
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Brief Description of The Drawing 

Figures 1 A and IB depict schematic nucleic acid and amino acid based multiple 
sequence alignments of SEQ ID NO: 1 - SEQ ID NO:7 and SEQ ID NO:8 - SEQ ID NO: 13, 
respectively. 

5 Figure 2 is a photograph of an exemplary reverse northern blot of several clones from a 

cDNA library of androgen treated prostate cancer cells. 

Figure 3 is a photograph of an exemplary multiple tissue northern blot of one of the 
isolated polynucleotides. 

Figure 4 is a photograph of an agarose gel after electrophoretic separation of splicing 
1 0 variants of SEQ ID NO: L 

Figure 5 is a series of photomicrographs illustrating the intracellular localization of GFP- 
fusion proteins of polypeptides of SEQ ID NO:8 - SEQ ID NO:10. 

Figure 6 is an autoradiograph of a northern blot indicating hormone dependent 
expression of SEQ ID NO: 1 - SEQ ID NO:7. 

15 Detailed Description 

As used herein, the term "intracellular protein" refers to a protein that is expressed and 
retained within a cell irrespective of its subcellular localization. For example, DNA polymerase 
(nucleus), glyceraldehyde-3 -phosphate dehydrogenase (cytoplasm), and cytochrome c oxidase 
(mitochondria) are all considered intracellular proteins. In contrast, the term "extracellular 
20 protein" refers to a protein that is exported from a cell. There are various mechanisms by which a 
protein can be exported from a cell, all of which are contemplated herein. For example, while 
many exported proteins have a signal sequence that allows specific export of the protein across a 
cell membrane, other proteins are exported through vesicles via the endoplasmatic reticulum, etc. 

' As also used herein, a protein has a "predominantly perinuclear localization" when a 
25 majority of the protein {i.e., more than 50% of the total amount as fluorimetrically detectable by 
GFP-fusion) is located in a volume around the nucleus that does not exceed a volume greater 
than three volumes of the nucleus. A protein has a "predominantly nuclear localization" when a . 
majority of the protein is located within the nucleus. In contrast, the term "nuclear localization" 



means that substantially all of the detectable protein is located within the nucleus. Localization 
of a protein can be determined fluorimetrically by GFP-fusion (GFP: green fluorescence 
protein). 

Employing a novel combined SSH-reverse northern blot procedure (first disclosed in 
5 provisional application 60/1 35,325 filed May 20, 1 999), we isolated seven cDNAs (SEQ ED 
NO:l - SEQ ID NO:7) from human prostate cancer cells. All seven cDNA molecules code for 
an intracellular protein, which is expressed in an androgen and glucocorticoid dependent manner. 
Remarkably, the sequences SEQ ID NO: 1 - SEQ ID NO:7 show a high degree of 
homology/identity with prostase, a previously reported serine protease (Nelson, P.S., et al. 
10 (1999) Proc. Natl. Acad. Sci. 96, 31 14-3119). However, in comparison with the prostase, all of 
the sequences SEQ ID NO:l - SEQ ID NO:7 lack a 49 amino acid N-terminal portion correspon- 
ding to the first exon of the prostase. It should be especially appreciated that the first exon of the 
prostase not only includes structurally important amino acids, but also includes a signal peptide 
sequence that renders the serine protease a secreted, extracellular enzyme. Consequently, the 
15 cDNA molecules with the sequence of SEQ ID NO:l - SEQ ID NO:7 encode intracellular 

proteins that are functionally and structurally different from the prostase, and are therefore inde- 
pendent and novel genes. 

With respect to the base composition of SEQ ID NO: 1 - SEQ ID NO:7, nucleotide se- 
quences other that SEQ ID NO:l - SEQ ID NO:7 are also contemplated and particularly include 

20 variations of SEQ ID NO:l - SEQ ID NO/.7 that include point mutations, insertions, deletions 
and any reasonable combination thereof, so long as alternative sequences encode an intracellular 
protein. Therefore, polynucleotides are contemplated that have at least 80%, preferably at least 
85%, more preferably at least 90% and most preferably at least 95% identity with the sequences 
of SEQ IDNO:l - SEQ IDNO:7, so long as contemplated polynucleotides encode an intracellular 

25 protein. 

For example, point mutations may arise at any position of the sequence from an apurinic, 
apyrimidinic, or otherwise structurally impaired site within the cDNA. Alternatively, point 
mutations may be introduced by random or site-directed mutagenesis procedures (e.g., oligonu- 
cleotide assisted or by error prone PCR). Likewise, deletions and/or insertions may be 
30 introduced into the sequences, and particularly preferred insertions comprise 5*- and/or 3'-fusions 
with a polynucleotide that encodes a reporter moiety or an affinity moiety. Other particularly 



preferred insertions comprise a nucleic acid that further includes functional elements such as a 
promoter, enhancer, hormone responsive element, origin of replication, transcription and 
translation initiation sites, etc. It should especially be appreciated that where insertions with one 
or more functional elements are present, the resulting nucleic acid may be linear or circular (e.g., 
transcription or expression cassettes, plasmids, etc.). 

Still further contemplated variations include substitution of one or more atoms or chemi- 
cal groups in the sequence with a radioactive atom or group. For example, where contemplated 
cDNAs are employed as a hybridization-specific probes, a fluorophor or enzyme (e.g., (3-galacto- 
sidase for generation of a dye, or luciferase for generation of luminescence) may be coupled to 
the sequence to identify position and/or quantity of a complementary sequence. Alternatively, 
where contemplated cDNA molecules are utilized for affinity isolation procedures, the cDNA 
may be coupled to a molecule that is known to have a high-affinity (i.e., K^KTW') partner, 
such as biotin, or an oligo-histidyl tag. In another example, one or more phosphate groups may 
be exchanged for a radioactive phosphate group with a 32 P or 33 P isotope to assist in detection 
and quantification, where the radiolabeled cDNA is employed as a hybridization probe. 

It is also contemplated that the polypeptides (encoded by SEQ ID NO:l - SEQ ID NO:7) 
having the peptide sequence of SEQ ID NO:8 - SEQ ID NO:14 may be produced in vivo or in 
vi/ro, and may be chemically and/or enzymatically modified. Contemplated polypeptides can be 
isolated from prostate tissue or prostate cancer cells that may or may not be in a hormone depen- 
dent state. Alternatively, and especially where larger amounts (i.e., >10mg) are desirable, recom- 
binant production (e.g., in a bacterial, yeast, insect cell, or mammalian cell system) may advanta- 
geously be employed to generate significant quantities of contemplated polypeptides. 

It should further be appreciated that recombinant production not only offers a more eco- 
nomical strategy to produce contemplated polypeptides, but also allows specific modification in 
the amino acid sequence and comppsition to tailor particular biochemical, catalytic and physical 
properties. For example, where increased solubility of contemplated polypeptides is desirable, 
one or more hydrophobic amino acids may be replaced with hydrophilic amino acids. Alternati- 
vely, where reduced or increased catalytic activity is required, one or more amino acids may be 
replaced or eliminated. In still another example, fusion proteins with contemplated proteins are 
) contemplated, in which an additional polypeptide is added to the N-terminus and/or C-terminus 



of contemplated polypeptide. Particularly contemplated fusion proteins include fusions w-jth en- 
zymatically active fusion partners (e.g., for dye formation or substrate conversion) and fluores- 
cent fusion partners such as GFP, EGFB, BFP, etc. Therefore, sequences other than the sequen- 
ces of SEQ ID NO: 8 - SEQ ID NO: 14 are also contemplated, so long as the polypeptides are 
5 intracellular proteins, and alternative peptide sequences may have a sequence that has a 70%, 
preferably 80%, more preferably 90%, and most preferably 95% homology to the sequences of 
SEQ ID NO:8 - SEQ ID NO: 14. 

With respect to chemical and enzymatic modifications of contemplated polypeptides, it is 
contemplated that many modifications are appropriate, including addition of mono-, and bifunc- 

10 tional linkers, coupling with protein- and non-protein macromolecules, and glycosylation. For 
example, mono- and Afunctional linkers are especially advantageous where contemplated poly- 
peptides are immobilized to a solid support, or covalently coupled to a molecule that enhances 
immunogenicity of contemplated polypeptides (e.g., KLH, or BSA conjugation). Alternatively, 
contemplated polypeptides may be coupled to antibodies or antibody fragments to allow rapid 

15 retrieval of the polypeptide from a mixture of molecules. Further contemplated couplings include 
covalent and non-covalent coupling of contemplated polypeptides with molecules that prolong 
the serum half-life and/or reduce immunogenicity such as cyclodextranes and polyethylene 
glycols. 

In a particularly contemplated aspect of the inventive subject matter, SEQ ID NO: 15 - 
20 SEQ ID NO:21 (the corresponding mRNA of SEQ ID NO: 1 - SEQ ID NO:7) are employed in a 
method of detecting a neoplastic cell in a system. In one step, a predetermined quantity of an 
RNA comprising at least one of SEQ ID NO: 15 - SEQ ID NO:21 in a cell containing system is 
correlated with the presence of a neoplastic cell, wherein the RNA encodes an intracellular 
polypeptide, and in a further step, an amount of at least the predetermined quantity of the RNA is 
25 detected in the system. 

In a preferred embodiment, the system is a mammal (most preferably a human) and the 
neoplastic cell is a prostate cancer cell in a biopsy specimen. The total RNA is extracted from the 
biopsy specimen, and a real time quantitative rt-PCR employing individual reactions with primer 
pairs specific to each of the sequences of SEQ ID NO: 15 - SEQ ED NO:21 is performed in 
30 parallel with a biopsy specimen known to be free of cancer cells. Biopsy specimens are deter- 
mined to have a cancer cell, where the detected mRNA quantity of SEQ ID NO: 15 - SEQ ID 
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NO:21 is at least 3 times higher than in the control specimen. A preferred extraction of total 
RNA utilizes the Quiagen BioRobot kit in conjunction with the BioRobot 9600 system, and the 
real time rtPCR is performed in a Perkin Elmer ABI Prism 7700. 

In alternative aspects of the inventive subject matter, the method of detecting a neoplastic 
5 cell need not be limited to biopsy tissues from prostate tissue, but may employ various alterna- 
tive tissues, including lymphoma tumor cells, and various solid tumor cells, so long as such 
tumor cells overproduce mRNA of the SEQ ID NO: 1 5 - SEQ ID NO:21 . Appropriate alternative 
tumor cells can readily be identified by the above described method. Likewise, the system need 
not be restricted to a mammal, but may also include cell-, and tissue cultures grown in vitro, and 
10 tumor cells and specimens from animals other than mammals. 

For example, tumor cell and tissue grown in vitro may advantageously be utilized to 
investigate drug action on such cells, and the overabundance of sequences of SEQ ID NO: 15 • 
SEQ ED NO:21 may conveniently be employed as tumor marker. Alternatively, body fluids {e.g., 
serum, saliva, etc.) that may or may not contain tumor cells are also contemplated a suitable 
15 substrate for the method presented herein, so long as they contain to at least some extent mRNA 
with a sequence of SEQ ID NO:15 - SEQ ID NO:21 . 

With respect to the detection method it is contemplated that many methods other than 
quantitative real time rt-PCR are also appropriate, and particularly contemplated methods in- 
clude hybridization of a probe to at least one of SEQ ID NO:15 - SEQ ID NO:21. It is especially 

20 contemplated that suitable probes are labeled, and depending on the physico-chemical nature of 
the probe, the detection process may include fluorescence detection, luminescence detection, 
scintigraphy, autoradiography, and formation of a dye. For example, for microscopic analysis of 
biopsy specimens, fluorescein modified sequence probes (complementary to at least one of SEQ 
ID NO:15 - SEQ ID NO:21) are particularly advantageous. Fluorescence quantification may then 

25 be performed utilizing a CCD-video, analysis package. Similarly, luminescence may be detected 
with a luminometer coupled to a microscope, or where tissue pieces are submerged in a sample 
cuvette, luminescence may be determined in the sample fluid. It should be appreciated that 
labeling of oligonucleotides and hybridization of the labeled oligonucleotide is a technique that 
is well known in the art, and that all known methods are generally suitable for use in conjunction 

30 with methods contemplated herein. Alternatively, the amount of mRNA may also be determined 
by first hybridizing a probe to the mRNA and subsequently enzymatically coupling of at least 
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one nucleotide to the probe, and especially contemplated enzymatic additions include LCR and 
PCR. 

In still other aspects of contemplated methods, the mRNA quantity need not necessarily 
be limited to at least 3 times more than in the control specimen in order to establish that the 
5 tissue has a cancer cell. For example, where the concentration of mRNA is hormone dependent, 
higher amounts between 3-8 fold and more may be appropriate. In contrast, where the concen- 
tration of cancer cells in the biopsy specimen is relatively low, amounts of less than 3-fold, in- 
cluding 1.5 to 2.9-fold and less are contemplated. 

In another particularly contemplated aspect of the inventive subject matter, polypeptide 
10 of SEQ ID NO: 8 - SEQ ID NO:14 (encoded by SEQ ID NO: 1 - SEQ ID NO:7) are employed 
in a method of detecting a neoplastic cell in a system. In one step, a predetermined quantity of an 
intracellular polypeptide comprising at least one of SEQ ID NO: 8 - SEQ ID NO: 14 in a cell 
containing system is correlated with the presence of a neoplastic cell, and in a further step, an 
amount of at least the predetermined quantity of the RNA is then detected in the system. 

15 In a preferred embodiment, the system is a mammal (most preferably a human) and the 

neoplastic cell is a prostate cancer cell or a breast cancer cell in a biopsy specimen. The biopsy 
specimen that is suspected to have a cancer cell is flash frozen, dissected on a microtome, and 
sections are mounted on microscope slides. The sections are subsequently incubated with a 
fluorescein labeled antibody that is directed against an epitope of at least one of the polypeptides 

20 of SEQ ID NO:8 - SEQ ID NO: 14. Fluorescence is detected with a fluorescence microscope 
coupled to a CCD-video camera and image analysis equipment. Biopsy specimens are deter- 
mined to have a cancer cell, where the fluorescence signal/quantity of one or more cells is at 
least 3 times higher than in the control specimen. 

In alternative aspects of the inventive subject matter, the method of detecting a neoplastic 
25 cell need not be limited to biopsy tissues from prostate tissue, but may employ various alterna- 
tive tissues, including lymphoma tumor cells, and various solid tumor cells, so long as such 
tumor cells overproduce polypeptides of the SEQ ID NO:8 - SEQ ED NO: 14. Appropriate alter- 
native tumor cells can readily be identified by the above described method. Likewise, the system 
need not be restricted to a mammal, but may also include cell, and tissue cultures grown in vitro, 
30 and tumor cells and specimens from animals other than mammals. For example, tumor cell and 
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tissue grown in vitro may advantageously be utilized to investigate drug action on such cells, and 
the polypeptides of SEQ ID NO:8 - SEQ ID NO: 14 may conveniently be employed as a tumor 
marker. Alternatively, body fluids {e.g., serum, saliva, etc.) that may or may not contain tumor 
cells are also contemplated as suitable substrates for the method presented herein, so long as they 
5 contain to at least some extent the polypeptides of SEQ ID NO:8 - SEQ ID NO:14. 

With respect to detection methods, it is contemplated that many methods other than 
fluorescence microscopy are also appropriate, and particularly contemplated methods include 
specific binding of a probe to at least one of SEQ ID NO:8 - SEQ ID NO: 14. It is especially 
contemplated that suitable probes are labeled, and depending on the physico-chemical nature of 
10 the probe, the detection process may include fluorescence detection, luminescence detection, 
scintigraphy, autoradiography, and formation of a dye. 

For example, for microscopic analysis of biopsy specimens, luciferase labeled probes are 
particularly advantageous in conjunction with a luminescence substrate (e.g., luciferin). Lumi- 
nescence quantification may then be performed utilizing a CCD-camera and image analysis sys- 

15 tern. Similarly, radioactivity may be detected via autoradiographic or scintigraphic procedures on 
a tissue section, in a fluid or on a solid support. Where the probe is a natural or synthetic ligand 
of contemplated polypeptides, particularly contemplated ligands include molecules with a 
chemical modification that increase the affinity to the polypeptide and/or induce irreversible 
binding to the polypeptide. For example, transition state analogs or suicide inhibitors for a 

20 particular reaction catalyzed by the polypeptide are especially contemplated. Labeling of 

antibodies, antibody fragments, small molecules, and binding of the labeled entity is a technique 
that is well known in the art, and it is contemplated that all known methods are generally suitable 
fcr use in conjunction with methods contemplated herein. Furthermore, the probe need not be 
linr ed to a fluorescein labeled antibody, and alternative probes include antibody fragments (e.g., 

25 Fab, Fab 1 , scFab, etc.). 

In still other aspects of contemplated methods, the polypeptide quantity need not neces- 
sarily be limited to at least 3 times more than the control specimen in order to establish that the 
tissue has a cancer cell {e.g., where the control reads lOOng, three times more than the control 
means 300ng). For example, where the concentration of the polypeptide is hormone dependent, 
30 higher amounts between 3-8 fold and more may be appropriate. In contrast, where the concen- 
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tration of cancer cells in the biopsy specimen is relatively low, amounts of less than 3-fold, in- 
cluding 1.5 to 2.9-fold and less are contemplated. 

It should further be appreciated that the polynucleotides of SEQ ID NO:l- SEQ ID NO:7 
and SEQ ID NO: 15 - SEQ ID NO:21 may be employed as a therapeutic modality in an antisense 
5 DNA/RNA based therapy. Anti-sense therapy, for example, could be employed to inhibit, up-, or 
down-regulate transcription or translation of the genes corresponding to SEQ ID NO:l- SEQ ED 
NO:7. It should further be appreciated that an anti-sense approach may also include regulatory 
sequences associated with SEQ ID NO:l- SEQ ID NO:7 such as transcription enhancers, hor- 
mone responsive elements, ribosomal- and RNA polymerase binding sites, etc., which may be 
10 located upstream or downstream of SEQ ID NO:l- SEQ ID NO:7, and may have a distance of 
several ten base pairs to several ten thousand base pairs. 

Alternatively, the polypeptides of SEQ ID NO:8- SEQ ID NO: 14 may also be employed 
in an antibody based therapy or a small molecule drug therapy directed towards the polypeptides 
of SEQ ID NO:8- SEQ ID NO: 14. For example, antibody based therapy could be employed to 
15 neutralize, or remove corresponding polypeptides of SEQ ID NO:8- SEQ ED NO:14 z/i-vzvo, or 
to interfere with one or more cellular functions of contemplated polypeptides. 

Figures 1A and IB show a schematic and an amino acid based alignment between the 
cDNAs of SEQ ID NO:l - SEQ ID NO:7, in which SEQ ID NO:l is the full-length cDNA and 
SEQ ID N02 - SEQ ID NO:7 are splicing variants of SEQ ID NO:l. In the amino acid based 
20 alignment SEQ ID NO:8 is the corresponding polypeptide to SEQ ID NO:l, and SEQ ID NO:9 - 
SEQ ID NO: 14 are the corresponding polypeptides to SEQ ID NO:2 - SEQ ID NO:6. 

Examples 

The following examples illustrate the isolation and cloning of the sequences of SEQ ED 
NO:l - SEQ ID NO:7 form normal prostate tissue and from prostate cancer cells. SEQ ID NO:8 - 
25 SEQ ID NO:14, and SEQ ED N015 - SEQ ID NO:21 are computer generated transcriptions and 
translations of SEQ ID NO 1 - SEQ ID NO:7, respectively. 

The following examples also illustrate a general method of identifying differentially ex- 
pressed genes in a target tissue, in which in one step a target tissue-specific cDNA library is 
provided that has a plurality of tissue-specific genes obtained by suppression subtractive 



t 
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hybridization. In a subsequent step, a predetermined quantity of tissue-specific genes is 
immobilized on a solid phase to form a tissue-specific cDNA array, and a first nucleic acid 
preparation is hybridized to a first tissue-specific cDNA array to create a first hybridization 
pattern, wherein the first preparation is prepared from the target tissue without previously 
5 exposing the target tissue to a compound. In a further step, a second nucleic acid preparation is 
hybridized to a second tissue-specific cDNA array to create a second hybridization pattern, 
wherein the second preparation is prepared from the target tissue after previously exposing the 
target tissue to a compound. In yet a further step, the first and the second hybridization pattern 
are then compared to identify differentially expressed genes. This general method is especially 
10 contemplated where the compound comprises a hormone, or various other suitable ligands. 

Su ppression Subtraction of Prostate Specific Genes 

cDNA derived from poly(A)+ RNA of 10 different normal human tissues were subtrac- 
ted against normal human prostate cDNA using suppression subtraction hybridization (SSH) 

15 (Diatchenko, L., Lau, Y.-F- C, Campbell, A.P., Chenchik, A., Moqadam, F., Huang, B., Luky- 
anov, S., Lukyanov, K., Gurskaya, N., Sverdlov, E.D., Siebert, P.D. (1996). Proc. Natl Acad. 
Sci. USA 93, 6025-6030), and the resulting cDNA fragments were cloned into an appropriate 
vector. SSH was performed as described (Clontech PCR-Select Cloning Kit) using prostate 
p: >y(A)+ RNA against a pool of poly(A)+ RNA obtained from ten normal human tissues (heart, 

20 brain, placenta, lung, liver, skeletal muscle, kidney, spleen, thymus, and ovary). Upon secondary 
PCR amplification (12 cycles), reactions were extracted with phenol/chloroform and DNA was 
precipitated with EtOH. 

The pellet was washed once with 70% EtOH. After drying, the DNA pellet was dissolved 
in 0.2xTE or dH 2 0 and cut with Rsal in a 20 ul reaction for 2 hrs at 37C to excise adaptors. After 
25 digestion, reactions were run on a 1.5 % agarose gel, with molecular size markers on one side, at 
5 V/cm, 40 min. The adopter bands are excised and discarded, and cDNA bands were cut out and 
purified (QAIEX gel DNA purification kit) after running the gel backwards to concentrate the 
cDNA. 

The purified DNA was subcloned into EcoRV-cut, dephosphorylated pZERO vector 
30 from Invitrogen. DH10B electrocompetent cells (>10 10 efficiency) were transformed with a 1/5 
dilution of 1 fj.1 of the ligation mix. 
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Colonies were picked and the presence of cDNA inserts confirmed by PCR with T7 and 
SP6 primers directly from the colonies. 10% of reactions were run on a 1.5% agarose gel to 
visualize amplified products. The colonies with inserts were grown and glycerol stocks (15%) 
were prepared and stored at — 80C. 

Reverse Northern Blot and Sequence Analyses 

Clones from the SSH library were amplified by PCR and spotted on nylon filters in 96- 
well format to generate two identical blots for each set of 92 clones (the remaining four spots 
were used for positive and negative controls). For probe preparation, the androgen-responsive 
prostate cancer cell line LNCaP (Horoszewicz, J.S., Leong, S.S., Kawinski, E., Kair, J.P., 
Rosenthal, H., Chu, T.M., Mirand, E.A., and Murphy, G.P. (1983). Cancer Res. 43, 1809-1818) 
was employed that was either untreated [the (-) probe] or treated with the synthetic androgen 
R1881 for 24 hours [the (+) probe]. Poly(A)+ RNA was isolated from these cells and was used 
to make the 32 P-labeled probes. After hybridization with the (-) and (+) probes, clones showing 
differential hybridization were selected for further analysis (z.e., confirmation by a secondary 
reverse northern blot, and northern blotting). 

Reverse northern screening on the cDNA clones were done essentially as described else- 
where (Hedrick, S.M., Cohen, D.I., Neilson, E.A., Davis, M.M. (1984) Nature 308, 149-153; 
Sakaguchi N, Berger CN, Melchers F (1986). EMBO J 5: 2139-2147). DNA (approximately 400 
ng) from PCR amplification in step 6 was diluted in 200 nl of 0.4M NaOH, 10 mM EDTA and 
mixed well by pipetting. After incubation at 95°C for 5-10 min, the tubes were chilled on ice. 
Denatured DNA was blotted on two separate pieces of Zeta Probe GT+ membrane (Bio-Rad) 
using a dot-blot apparatus (Bio-Rad). Positive [Prostate specific antigen (PSA) cDNA] and 
negative [glyceraldehyde 3-phosphate dehydrogenase (G3PDH) cDNA] controls are included on 
each blot in duplicate. Membranes were rinsed with 2XSSC, air dried, and then baked at 80°C 
for 30 min. A typical example of a reverse northern analysis is shown in Figure 2. In each blot 
pair, PSA and G3PDH are included as positive and negative controls, respectively. It should be 
noted that there was substantial increase in PSA hybridization in the (+) blot (probe prepared 
from cells that have been stimulated by androgens) compared with the (-) blot (probe prepared 
from untreated cells), whereas there was no significant change in hybridization of G3PDH 
between the two blots. Arrowheads indicate differentially expressed clones. 
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As a control to verify the prostate specific nature of isolated sequences, positive clones 
were tested in a standard northern blot against RNA preparations of multiple non-prostate tissues 
and a typical blot is shown in Figure 3. Lanes 1-10, and 12-16 are RNA preparations from non- 
prostate tissues, lane 1 1 is a RNA preparation from prostate, lane 12 is a RNA preparation from 
5 testis. 

Probes: The probes were random-prime radiolabeled using standard laboratory proce- 
dures. Unincorporated nucleotides were removed using prespun G25 columns (BiorRad), and 
specific activity was typically over 5xl0 8 cpm/jig. 

Hybridization: 25 ml Hybridization mix (7% SDS, 0.5 M NaHP0 4 , ImM EDTA) at 65°C 
10 is prewarmed, and 12.5 ml were used for prehybridization of each membrane for 5-10 min at 65 
°C. Probes were heat denatured at 95 °C for 3-5 min and transferred to the prehybridization mix 
at 65°C. Hybridization was done at 65°C overnight. 

Washing: Wash solution I (2xSSC and 1% SDS) and wash solution II (O.lxSSC and 
0.5% SDS) were prewarmed. Membranes were washed once with Solution I and then Solution II 
15 for 30 min at 65°C, covered with plastic wrap and exposed to phoshorimager screen. 

Selection: Clones showing differential expression between the (-) and (+) blots were 
picked. A secondary round of reverse northern analysis is performed for confirmation by 
spotting each clone in duplicate on each blot. To confirm hormone dependence, a time course of 
R1881 induction of LNCaP cells as well as the CWR22 xenograft model upon androgen ablation 

20 (Wainstein, M. A., He, F., Robinson, D., Kung, H. J., Schwartz, S., Giaconia, J. M., Edgehouse, 
N. L., Pretlow, T. P., Bodner, D. R., Kursh, E. D., and Pretlow, T.G. (1994). Cancer Res. 54, 
6049-6052) and the androgen-independent CWR22R relapsed xenograft (Nagabhushan, M., 
Miller, C. M., Pretlow, T. P., Giaconia, J. M., Edgehouse, N. L., Schwartz, S., Kung, H. J., de 
Vere White, R. W., Gumerlock, P. H., Resnick, M. L, Amini, S. B. , and Pretlow, T. G. (1996). 

25 Cancer Res. 56, 3042-6) was used. 

Sequence analysis: Sequence analysis was performed by the dideoxy chain termination 
methods using an ABI automated sequencer. It should be appreciated that many more androgen 
responsive, differentially expressed genes can be identified and isolated using the cloning stra- 
tegy outlined above, including genes expressed during various growth and developmental phases 
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of a diseased prostate, and genes expressed as a result of a drug regimen. Moreover, it is contem- 
plated that not only differentially expressed prostate cancer genes can be identified and isolated, 
but also genes involved in other diseased states of human prostate, including benign prostate 
hyperplasia, etc. 

Isolation of Splice Variants (SEP ID NO: 2 - SEP ID NO: 7) 

Poly(A)* RNA was isolated from LNCaP cells treated with R1881 (a synthetic androgen) 
and from androgen dependent prostate cancer xenograft CWR22 grown in nude mice in the 
presence of androgens. cDNA was prepared and subjected to PCR using SEQ ID NO:l specific 
primers and a primer pair designed to amplify the previously published prostase. The respective 
S'-primers were located around the translation start site, while the 3'-primer for all reactions was 
located around the stop codon. Reaction products were loaded onto an agarose gel and separated 
as shown in Figure 4. Lane 1 is a marker, lane 2 is positive control with SEQ ID NO:l as 
template. Lanes 3-5 are PCR products from CWR22 cells with SEQ ID NO:l specific primers, 
while lanes 6-8 are PCR products from CWR22 cells with prostase specific 5-primer. Lanes 9- 
1 1 are PCR products from LNCaP cells with SEQ ID NO:l specific primers, and lanes 12-14 are 
PCR products from LNCaP cells with prostase specific primers. Lane 15 is marker. 

Only reactions with SEQ ID NO:l specific primers yielded detectable PCR products, 
with a major band at 680bp (SEQ ID NO:l), and two additional bands at about 500bp (SEQ ID 
NO:2) and 750bp (SEQ ID NO:3). When primers for 5'-RACE analysis were employed, four 
additional PCR products were obtained, corresponding to SEQ ID N0:4,* SEQ ID NO:5, and 
SEQ ID NO:6, respectively. All bands were sequenced to confirm their identity. SEQ ID NO:7 
was obtained both as a 3'-RACE product as well as a distinct clone from the PSL library. 

Localization of Intracellular Proteins encoded by SEP ID NO: 1 - SEP ID NO: 3 

To gain insight into the subcellular location of the polypeptides SEQ ID NO:8 - SEQ ID 
NO: 10 (encoded by SEQ ID NO:l -'SEQ ID NO:3), C-terminal fusion constructs with GFP were 
produced in COS 7 cells. The cells were fixed, stained with DAPI and visualized by phase con- 
trast or fluorescence microscopy and representative images are shown in Figure 5. Photographs 
in lane A depict a GFP fusion protein with SEQ ID NO:9, the photographs in lane B depict a 
GFP fusion protein with SEQ ID NO: 10, the photographs in lane C depict a GFP fusion protein 
with SEQ ID NO:8, and the photographs in lane D depict a GFP protein as a control. The poly- 
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peptide of SEQ ID NO: 8 displayed strong granular fluorescence predominantly around the 
nucleus, while the polypeptide of SEQ ID NO:9 and SEQ ID NO: 10 showed exclusively nuclear 
and predominantly nuclear localization, respectively. Due to the lack of an identifiable leader 
sequence that would indicate an export of the polypeptides of SEQ ID NO:l 1 - SEQ ID NO: 14, 
it is contemplated that the sequences SEQ ID NO:l 1 - SEQ ID NO: 14 are also intracellular 
proteins. 

Regulation of Expression by multiple Hormones 

Untreated LNCaP cells and hormone treated LNCaP cells were employed to determine 
the hormone dependence of expression of SEQ ID NO:l . Treatment was as follows: Testoster- 
one (T) at 10~ 8 M, dihydrotestosterone (DHT) at 10* 8 M, estradiol (E2) at lO^M, progesterone (P) 
at lO^M, dexamethasone (Dex) at 10" 7 M, 1, 25-dihydroxy-vitamin D3 (VitD3) at 10" 8 M, and 
triiodothyronine (T3) at 10* 7 M. The total RNA of the treated cells was isolated and used in a 
northern blot analysis with radiolabeled SEQ ID NO:l as probe. Figure 6 shows the results of an 
autoradiograph of a northern blot as described above. 1 8S-RNA is shown as control for RNA 
integrity and loading. The relative induction of SEQ ID NO:l is indicated at the bottom of the 
lanes as determined by phosphorimager analysis. It is contemplated that SEQ ID NO:2 - SEQ ID 
NO:7 are splice variants of SEQ ID NO:l, and consequently it is contemplated that the express- 
ion of all of SEQ ID NO:l - SEQ ID NO:6 is hormone dependent, and particularly contemplated 
hormones include androgens, progesterones, estrogens and glucocorticoids. 

Thus, specific embodiments and applications of methods and applications of differen- 
tially expressed genes in prostate cancer cells have been disclosed. It should be apparent, 
however, to those skilled in the art that many more modifications besides those already described 
are possible without departing from the inventive concepts herein. The inventive subject matter, 
therefore, is not to be restricted except in the spirit of the appended claims. Moreover, in 
interpreting both the specification and the claims, all terms should be interpreted in the broadest 
possible manner consistent with the context. In particular, the terms "comprises" and 
"comprising" should be interpreted as referring to elements, components, or steps in a non- 
exclusive manner, indicating that the referenced elements, components, or steps may be present, 
or utilized, or combined with other elements, components, or steps that are not expressly 
referenced. 
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SEQUENCE LISTING 

SEQ ID NO : 1 

atggaaaacg aattgttctg ctcgggcgtc ctggtgcatc cgcagtgggt gctgtcagcc 60 
gcacactgtt tccagaactc ctacaccatc gggctgggcc tgcacagtct tgaggccgac 120 
caagagccag ggagccagat ggtggaggcc agcctctccg tacggcaccc agagtacaac 180 
agacccttgc tcgctaacga cctcatgctc atcaagttgg acgaatccgt gtccgagtct 24 0 
gacaccatcc ggagcatcag cattgcttcg cagtgcccta ccgcggggaa ctcttgcctc 300 
gtttctggct ggggtctgct ggcgaacggc agaatgccta ccgtgctgca gtgcgtgaac 360 
gtgtcggtgg tgtctgagga ggtctgcagt aagctctatg acccgctgta ccaccccagc 420 
atgttctgcg ccggcggagg gcaagaccag aaggactcct gcaacggtga ctctgggggg 480 
cccctgatct gcaacgggta cttgcagggc cttgtgtctt tcggaaaagc cccgtgtggc 54 0 
caagttggcg tgccaggtgt ctacaccaac ctctgcaaat tcactgagtg gatagagaaa 600 
accgtccagg ccagttaa 618 

SEQ ID NO: 2 

atggaaaacg aattgttctg ctcgggcgtc ctggtgcatc cgcagtgggt gctgtcagcc 60 
gcacactgtt tccagaactc ctacaccatc gggctgggcc tgcacagtct tgaggccgac 12 0 
caagagccag ggagccagat ggtggaggcc agcctctccg tacggcaccc agagtacaac 180 
agacccttgc tcgctaacga cctcatgctc atcaagttgg acgaatccgt gtccgagtct 24 0 
gacaccatcc ggagcatcag cattgcttcg cagtgcccta ccgcggggaa ctcttgcctc 3 00 
gtttctggct ggggtctgct ggcgaacggg tgactctggg gggcccctga tctgcaacgg 360 
gtacttgcag ggccttgtgt ctttcggaaa agccccgtgt ggccaagttg gcgtgccagg 420 
tgtctacacc aacctctgca aattcactga gtggatagag aaaaccgtcc aggccagtta 4 80 

481 

SEQ ID NO: 3 

atggaaaacg aattgttctg ctcgggcgtc ctggtgcatc cgcagtgggt gctgtcagcc 60 
gcacactgtt tccagaactc ctacaccatc gggctgggcc tgcacagtct tgaggccgac 12 0 
caagagccag ggagccagat ggtggaggcc agcctctccg tacggcaccc agagtacaac 180 
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agacccttgc 


tcgctaacga 


cctcatgctc 


atcaagttgg 


acgaatccgt 


gtccgagtct 


240 


gacaccatcc 


ggagcatcag 


cattgcttcg 


cagtgcccta 


ccgcggggaa 


ctcttgcctc 


300 


gtttctggct 


ggggtctgct 


ggcgaacggt 


gagctcacgg 


gtgtgtgtct 


gccctcttca 


360 


aggaggtcct 


ctgcccagtc 


gcgggggctg 


acccagagct 


ctgcgtccca 


ggcagaatgc 


420 


ctaccgtgct 


gcagtgcgtg 


aacgtgtcgg 


tggtgtctga 


ggaggtctgc 


agtaagctct 


480 


atgacccgct 


gtaccacccc 


agcatgttct 


gcgccggcgg 


agggcaagac 


cagaaggact 


540 


cctgcaacgg 


tgactctggg 


ggggccctga 


tctgcaacgg 


gtacttgcag 


ggccttgcgt 


600 


ctttcggaaa 


agccccgtgt 


tggccaagtt 


ggcgtgccag 


gtgtctacac 


caacctctgc 


660 


aaattcactg 


agtggataga 


gaaaaccgtc 


caggccagtt 


aa 




702 


SEQ ID NO: 4 












ggaatgagcc 


tggatccggg 


gagcccagag 


ggaagggctg 


ggaggcggga 


atcttgcttc 


60 


ggaaggactc 


agagagtcct 


gacttgaaat 


ctcagcccag 


tgctgagtct 


ctagtgaact 


12 0 


aagctcctac 


accatcgggc 


tgggcctgca 


cagtcttgag 


gccgaccaag 


agccagggag 


180 


ccagatggtg 


gaggccagcc 


tctccgtacg 


gcacccagag 


tacaacagac 


ccttgctcgc 


240 


taacgacctc 


atgctcatca 


agttggacga 


atccgtgtcc 


gagtctgaca 


ccatccggag 


300 


catcagcatt 


gcttcgcagt 


gccctaccgc 


ggggaactct 


tgcctcgttt 


ctggctgggg 


360 


tctgctggcg 


aacggtgaac 


tcacgggtgt 


gtgtctgccc 


tcttcaagga 


ggtcctctgc 


420 


ccagtcgcgg 


gggctgaccc 


agagctctgc 


gtcccaggca 


gaatgcctac 


cgtgctgcag 


480 


tgcgtgaacg 


tgtcggtggt 


gtctgaggag 


gtctgcagta 


agctctatga 


cccgctgtac 


540 


caccccagca 


tgttctgcgc 


cggcggaggg 


caagaccaga 


aggactcctg 


caacggtgac 


600 


tctggggggc 


ccctgatctg 


caacgggtac 


ttgcagggcc 


ttgtgtcttt 


cggaaaagcc 


660 


ccgtgtggcc 


aagttggcgt 


gccaggtgtc 


tacaccaacc 


tctgcaaatt 


cactgagtgg 


720 


atagagaaaa 


ccgtccaggc 


cagttaactc 


tggggactgg 


gaacccatga 


aattgacccc 


780 


caaatacatc 


ctgcggaagg 


aattcaggaa 


tatctgatcc 




f- r* 


0*1/1 


SEQ ID NO: 5 












ggaatgagcc 


tggatccggg 


gagcccagag ggaagggctg 


ggaggcggga 


atcttgcttc 


60 


ggaaggactc 


agagagccct 


gacttgaaat 


ctcagcccag 


tgctgagtct 


ctagtgaact 


120 



-19- 



aagctcctac accatcgggc tgggcctgca 
ccagatggtg gaggccagcc tctccgtacg 
taacgacctc atgctcatca agttggacga 
catcagcatt gcttcgcagt gccctaccgc 
tctgctggcg aacggcagaa tgcctaccgt 
tgaggaggtc tgcagtaagc 

SEQ ID NO: 6 

ggctctggga ggaggacgga atgagcctgg 
ggcgggaatc ttgcttcgga aggactcaga 
tgagtctcta gtgaactaag ctcctacacc 
gaccaagagc cagggagcca gatggtggag 
aacagaccct tgctcgctaa cgacctcatg 
tctgacacca tccggagcat cagcattgct 
ctcgtttctg gctggggtct gctggcgaac 
aacgtgtcgg tggtgtctga ggaggtctgc 

SEQ ID NO: 7 

accaccccag catgttctgc gccggcggag 
gagaggggaa aggggagggc aggcgactca 
acacagggcc gcatggcgag atgcagagat 
agagagagaa actgagagaa acagggaaat 
agagaaacag aaacagacat gggggaggca 
cttccaacag catggggcct gagggcggtg 
cttttgactc cccaaaaaac ctgacta^aa 
aataacataa atagtcgatt tatgcatacg 
aattttttga tatttctaag ctacacagtt 
tctcctaaaa ttttttctaa tgtgtttatt 
cagttcaaac cagggttgtt caagggtcaa 



cagtcttgag gccgaccaag agccagggag 180 
gcacccagag tacaacagac ccttgctcgc 24 0 
atccgtgtcc gagtctgaca ccatccggag 300 
ggggaactct tgcctcgttt ctggctgggg 360 
gctgcagtgc gtgaacgtgt cggtggtgtc 420 

440 

atccggggag cccagaggga agggctggga 60 
gagccctgac ttgaaatctc agcccagtgc 120 
atcgggctgg gcctgcacag tcttgaggcc 180 
gccagcctct ccgtacggca cccagagtac 240 
ctcatcaagt tggacgaatc cgtgtccgag 300 
tcgcagtgcc ctaccgcggg gaactcttgc 360 
ggcagaatgc ctaccgtgct gcagtgcgtg 420 
agtaagc 457 

agcaagacca gaaggactcc tgcaacgtga 60 
gggaagggtg gagaaggggg agacagagac 120 
ggagagacac acagggagac agtgacaact 180 
aaacacagga ataaagagaa gcaaaggaag 240 
gaaacacaca cacatagaaa tgcagctgac 300 
acctccaccc aacagaaaat cctcttataa 360 
atagcctact gttgacgggg gagccttacc 420 
ttttatgcat tcatgatata cctttgttgg 4 80 
cgtctgtgaa tttttttaaa ttgttgcaac 540 
gaaaaaaatc caagtataag tggacttgtg 600 
ctgtgt 636 
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SEQ ID NO: 8 

auggaaaacg aauuguucug cucgggcguc 
gcacacuguu uccagaacuc cuacaccauc 
caagagccag ggagccagau gguggaggcc 
agacccuugc ucgcuaacga ccucaugcuc 
gacaccaucc ggagcaucag cauugcuucg 
guuucuggcu ggggucugcu ggcgaacggc 
gugucggugg ugucugagga ggucugcagu 
auguucugcg ccggcggagg gcaagaccag 
ccccugaucu gcaacgggua cuugcagggc 
caaguuggcg ugccaggugu cuacaccaac 
accguccagg ccaguuaa 

SEQ ID NO: 9 

auggaaaacg aauuguucug cucgggcguc 
gcacacuguu uccagaacuc cuacaccauc 
caagagccag ggagccagau gguggaggcc 
agacccuugc ucgcuaacga ccucaugcuc 
gacaccaucc ggagcaucag cauugcuucg 
guuucuggcu ggggucugcu ggcgaacggg 
guacuugcag ggccuugguc uuucggaaaa 
gucuacacca accucugcaa auucacugag 

SEQ ID NO: 10 

auggaaaacg aauuguucug cucgggcguc 
gcacacuguu uccagaacuc cuacaccauc 
caagagccag ggagccagau gguggaggcc 
agacccuugc ucgcuaacga ccucagcuca 
acaccauccg gagcaucagc auugcuucgc 



cuggugcauc cgcagugggu gcugucagcc 60 
gggcugggcc ugcacagucu ugaggccgac 120 
agccucuccg uacggcaccc agaguacaac 180 
aucaaguugg acgaauccgu guccgagucu 24 0 
cagugcccua ccgcggggaa cucuugccuc 300 
agaaugccua ccgugcugca gugcgugaac 360 
aagcucuaug acccgcugua ccaccccagc 420 
aaggacuccu gcaacgguga cucugggggg 4 80 
cuugugucuu ucggaaaagc cccguguggc 540 
cucugcaaau ucacugagug gauagagaaa 60 0 

618 

cuggugcauc cgcagugggu gcugucagcc 60 
gggcugggcc ugcacagucu ugaggccgac 120 
agccucuccg uacggcaccc agaguacaac 180 
aucaaguugg acgaauccgu guccgagucu 24 0 
cagugcccua ccgcggggaa cucuugccuc 300 
ugacucuggg gggccccuga ucugcaacgg 360 
gccccgugug gccaaguugg cgugccaggu 420 
uggauagaga aaaccgucca ggccaguuaa 480 

cuggugcauc cgcagugggu gcugucagcc 60 
gggcugggcc ugcacagucu ugaggccgac 120 
agccucuccg uacggcaccc agaguacaac 180 
ucaaguugga cgaauccgug uccgagucug 240 
agugcccuac cgcggggaac ucuugccucg 3 00 



uuucuggcug gggucugcug gcgaacggug 
ggagguccuc ugcccagucg cgggggcuga 
uaccgugcug cagugcguga acgugucggu 
ugacccgcug uaccacccca gcauguucug 
cugcaacggu gacucugggg gggcccugau 
uuucggaaaa gccccguguu ggccaaguug 
aauucacuga guggauagag aaaaccgucc 

SEQ ID NO: 11 

ggaaugagcc uggauccggg gagcccagag 
ggaaggacuc agagaguccg acuugaaauc 
agcuccuaca ccaucgggcu gggccugcac 
cagaggugga ggccagccuc uccguacggc 
acgaccucau gcucaucaag uuggacgaau 
ucagcauugc uucgcagugc ccuaccgcgg 
ugcuggcgaa cgggaacuca cgggugugug 
gucgcggggg cugacccaga gcucugcguc 
ugaacguguc gguggugucu gaggaggucu 
ccagcauguu cugcgccggc ggagggcaag 
gggggccccu gaucugcaac ggguacuugc 
guggccaagu uggcgugcca ggugucuaca 
agaaaaccgu ccaggccagu uaacucuggg 
uacauccugc ggaaggaauu caggaauauc 

SEQ ID NO: 12 

ggaaugagcc uggauccggg gagcccagag 
gaaggacuca gagagcccug acuugaaauc 
agcuccuaca ccacgggcug ggccugcaca 
agauggugga ggccagccuc uccguacggc 
acgaccucau gcucaucaag uuggacgaau 
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agcucacggg ugugugucug cccucuucaa 3 60 
cccagagcuc ugcgucccag gcagaaugcc 42 0 
ggugucugag gaggucugca guaagcucua 4 80 
cgccggcgga gggcaagacc agaaggacuc 54 0 
cugcaacggg uacuugcagg gccuuguguc 600 
gcgugccagg ugucuacacc aaccucugca 660 
aggccaguua a 701 

ggaagggcug ggaggcggga aucuugcuuc 60 
ucagcccagu gcugagucuc uagugaacua 12 0 
agucuugagg ccgaccaaga gccagggagc 180 
acccagagua caacagaccc uugcucgcua 24 0 
ccguguccga gucugacacc auccggagca 3 00 
ggaacucuug ccucguuucu ggcugggguc 360 
ucugcccucu ucaaggaggu ccucugccca 42 0 
ccaggcagaa gccuaccgug cugcagugcg 480 
gcaguaagcu cuaugacccg cuguaccacc 54 0 
accagaagga cuccugcaac ggugacucug 60 0 
agggccuugu gucuuucgga aaagccccgu 660 
ccaaccucug caaauucacu gaguggauag 720 
gacugggaac ccaugaaauu gacccccaaa 780 
ugaucccagc cccuccuccc 830 

ggaagggcgg gaggcgggaa ucuugcuucg 60 
ucagcccagu gcugagucuc uagugaacua 120 
gucuugaggc cgaccaagag ccagggagcc 180 
acccagagua caacagaccc uugcucgcua 24 0 
ccguguccga gucugacacc auccggagca 3 00 



ucagcauugc uucgcagugc ccuaccgcgg 
ugcuggcgaa cggcagaaug ccuaccgugc 
aggaggucug caguaagc 

SEQ ID NO: 13 

gcucugggag gaggacggaa ugagccugga 
gcgggaaucu ugcuucggaa ggacucagag 
gagucucuag ugaacuaagc uccuacacca 
accaagagcc agggagccag augguggagg 
acagacccuu gcucgcuaac gaccucaugc 
ugacaccauc cggagcauca gcauugcuuc 
cguuucuggc uggggucugc uggcgaacgg 
cgugucggug gugucugagg aggucugcag 

ID NO: 14 

accaccccag cauguucugc gccggcggag 
gagaggggaa aggggagggc aggcgacuca 
acacagggcc gcauggcgag augcagagau 
agagagagaa acugagagaa acagggaaau 
agagaaacag aaacagacau gggggaggca 
cuuccaacag cauggggccu gagggcggug 
cuuuugacuc cccaaaaaac cugacuagaa 
aauaacauaa auagucgauu uaugcauacg 
aauuuuuuga uauuucuaag cuacacaguu 
ucuccuaaaa uuuuuucuaa uguguuuauu 
aguucaaacc aggguuguuc aagggucaac 
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ggaacucuug ccucguuucu ggcugggguc 360 
ugcagugcgu gaacgugucg guggugucug 42 0 

438 

uccggggagc ccagagggaa gggcugggag 60 
agcccugacu ugaaaucuca gcccagugcu 120 
ucgggcuggg ccugcacagu cuugaggccg 180 
ccagccucuc cguacggcac ccagaguaca 240 
caucaaguug gacgaauccg uguccgaguc 3 00 
gcagugcccu accgcgggga acucuugccu 360 
cagaaugccu accgugcugc agugcgugaa 42 0 
uaagc 455 

agcaagacca gaaggacucc ugcaacguga 60 
gggaagggug gagaaggggg agacagagac 120 
ggagagacac acagggagac ac.ugacaacu 180 
aaacacagga auaaagagaa gcaaaggaag 240 
gaaacacaca cacauagaaa ugcagcugac 300 
accuccaccc aacagaaaau ccucuuauaa 360 
auagccuacu guugacgggg gagccuuacc 420 
uuuuaugcau ucaugauaua ccuuuguugg 4 80 
cgucugugaa uuuuuuuaaa uuguugcaac 54 0 
gaaaaaaauc caaguaaagu ggacuugugc 600 
ugugu 635 



SEQ ID NO: 15 

Met Glu Asn Glu Leu Phe Cys Ser Gly Val Leu Val His Pro Gin Trp 
Val Leu Ser Ala Ala His Cys Phe Gin Asn Ser Tyr Thr lie Gly Leu 



Gly Leu His Ser Leu Glu Ala Asp 
Glu Ala Ser Leu Ser Val Arg His 
Ala Asn Asp Leu Met Leu lie Lys 
Asp Thr lie Arg Ser He Ser He 
Asn Ser Cys Leu Val Ser Gly Trp 
Pro Thr Val Leu Gin Cys Val Asn 
Cys Ser Lys Leu Tyr Asp Pro Leu 
Gly Gly Gly Gin Asp Gin Lys Asp 
Pro Leu He Cys Asn Gly Tyr Leu 
Ala Pro Cys Gly Gin Val Gly Val 
Lys Phe Thr Glu Trp He Glu Lys 
SEQ ID NO: 16 

Met Glu Asn Glu Leu Phe Cys Ser 
Val Leu Ser Ala Ala His Cys Phe 
Gly Leu His Ser Leu Glu Ala Asp 
Glu Ala Ser Leu Ser Val Arg His 
Ala Asn Asp Leu Met Leu He Lys 
Asp Thr He Arg Ser He Ser He 
. Asn Ser Cys Leu Val Ser Gly Trp 

SEQ ID NO: 17 

Met Glu Asn Glu Leu Phe Cys Ser 
Val Leu Ser Ala Ala His Cys Phe 
Gly Leu His Ser Leu Glu Ala Asp 
Glu Ala Ser Leu Ser Val Arg Hi? 
Ala Asn Asp Leu Met Leu He Lys 
Asp Thr lie Arg Ser He Ser He 
Asn Ser Cys Leu Val Ser Gly Trp 
Thr Gly Val Cys Leu Pro Ser Ser 
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Gln Glu Pro Gly Ser Gin Met Val 
Pro Glu Tyr Asn Arg Pro Leu Leu 
Leu Asp Glu Ser Val Ser Glu Ser 
Ala Ser Gin Cys Pro Thr Ala Gly 
Gly Leu Leu Ala Asn Gly Arg Met 
Val Ser Val Val Ser Glu Glu Val 
Tyr His Pro Ser Met Phe Cys Ala 
Ser Cys Asn Gly Asp Ser Gly Gly 
Gin Gly Leu Val Ser Phe Gly Lys 
Pro Gly Val Tyr Thr Asn Leu Cys 
Thr Val Gin Ala Ser 

Gly Val Leu Val His Pro Gin Trp 
Gin Asn Ser Tyr Thr He Gly Leu 
Gin Glu Pro Gly Ser Gin Met Val 
Pro Glu Tyr Asn Arg Pro Leu Leu 
Leu Asp Glu Ser Val Ser Glu Ser 
Ala Ser Gin Cys Pro Thr Ala Gly 
Gly Leu Leu Ala Asn Gly 

Gly Val Leu Val His Pro Gin Trp 
Gin Asn Ser Tyr Thr He Gly Leu 
Gin Glu Pro Gly Ser Gin Met Val 
Pro Glu Tyr Asn Arg Pro Leu Leu 
Leu Asp Glu Ser Val Ser Glu Ser 
Ala Ser Gin Cys Pro Thr Ala Gly 
Gly Leu Leu Ala Asn Gly Glu Leu 
Arg Arg Ser Ser Ala Gin Ser Arg 



-24- 



Gly Leu Thr Gin Ser Ser Ala Ser Gin Ala Glu Cys Leu Pro Cys Cys 
Ser Ala 



SEQ ID NO: 18 



Met 


Val Glu 


Ala 


Ser 


Leu 


Ser 


Val 


Arg 


His 


Pro 


Glu 


Tyr 


Asn 


Arg 


Pro 


Leu 


Leu Ala 


Asn 


Asp 


Leu 


Met 


Leu 


He 


Lys 


Leu 


Asp 


Glu 


Ser 


Val 


Ser 


Glu 


Ser Asp 


Thr 


He 


Arg 


Ser 


He 


Ser 


He 


Ala 


Ser 


Gin 


Cys 


Pro 


Thr 


Ala 


Gly Asn 


Ser 


Cys 


Leu 


Val 


Ser 


Gly 


Trp 


Gly 


Leu 


Leu 


Ala 


Asn 


Gly 


Glu 


Leu Thr 


Gly 


Val 


Cys 


Leu 


Pro 


Ser 


Ser 


Arg 


Arg 


Ser 


Ser 


Ala 


Gin 


Ser Arg Gly Leu 


Thr 


Gin 


Ser 


Ser 


Ala 


Ser 


Gin 


Ala 


Glu 


Cys 


Leu 


Pro 



Cys Cys Ser Ala 



SEQ ID NO: 19 



Met Val Glu Ala 


Ser 


Leu 


Ser 


Val 


Arg 


His 


Pro 


Glu 


Tyr 


Asn 


Arg 


Pro 


Leu Leu Ala Asn 


Asp 


Leu 


Met 


Leu 


He 


Lys 


Leu 


Asp 


Glu 


Ser 


Val 


Ser 


Glu Ser Asp Thr 


He 


Arg 


Ser 


He 


Ser 


He 


Ala 


Ser 


Gin 


Cys 


Pro 


Thr 


Ala Gly Asn Ser 


Cys 


Leu 


Val 


Ser 


Gly 


Trp 


Gly 


Leu 


Leu 


Ala 


Asn 


Gly 


Arg Met Pro Thr 


Val 


Leu 


Gin 


Cys 


Val 


Asn 


Val 


Ser 


Val 


Val 


Ser 


Glu 


Glu Val Cys Ser 


Lys 
























SEQ ID NO: 20 


























Met Val Glu Ala 


Ser 


Leu 


Ser 


Val 


Arg 


His 


Pro 


Glu 


Tyr 


Asn 


Arg 


Pro 


Leu Leu Ala Asn 


Asp 


Leu 


Met 


Leu 


He 


Lys 


Leu 


Asp 


Glu 


Ser 


Val 


Ser 


Glu Ser Asp Thr 


He 


Arg 


Ser 


He 


Ser 


He 


Ala 


Ser 


Gin 


Cys 


Pro 


Thr 


Ala Gly Asn Ser 


Cys 


Leu 


Val 


Ser 


Gly 


Trp 


Gly 


Leu 


Leu 


Ala 


Asn 


Gly 


Arg Met Pro Thr 


Val 


Leu 


Gin 


Cys 


Val 


Asn 


Val 


Ser 


Val 


Val 


Ser 


Glu 



Glu Val Cys Ser Lys 
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SEQ ID NO: 21 

Ala He Ser Ser Gin Val Phe Trp 
Phe Cys Trp Val Glu Val Thr Ala 
Ser Ala Ala Phe Leu Cys Val Cys 
Ser Val Ser Leu Phe Leu Cys Phe 
Cys Phe Ser Gin Phe Leu Ser Leu 
Ser Pro Ser Leu His Leu Ala Met 
Pro Ser Pro Pro Phe Pro Glu Ser 
His Val Ala Gly Val Leu Leu Val 
Ala Gly Val 



Gly Val Lys Ser Tyr Lys Arg He 
Leu Arg Pro His Ala Val Gly Arg 
Val Ser Ala Ser Pro Met Ser Val 
Ser Leu Phe Leu Cys Leu Phe Pro 
Val Val Thr Val Ser Leu Cys Val 
Arg Pro Cys Val Ser Leu Ser Pro 
Pro Ala Leu Pro Phe Pro Leu Ser 
Leu Leu Ser Ala Gly Ala Glu His 



